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RAW SEQUENCE LISTING 
ERROR REPORT 



RECEIVED 



MAR 0 7 2002 
TECH CENTER 1600/2900 



The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 



THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY *, ; - ^ • 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTIN 2.1 e-mail help: patin2 1 help@uspto.gov or phone 703-306-4119 (R. Wax) 
PATENTIN 3.0 e-mail help: patin3help@uspto.gov or phone 703-306-4119 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.1 PROGRAM , ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW FOR ADDRESS: 
http;//ww.uspto.gov/web/offices/pac/checker 

Applicants submitting genetic sequence^nformation electronically on diskette or CD-Rom should be aware that the: 
a possibility that the disk/CD-Rom may have been affected by treatment given to all incoming mail. 
Please consider using alternate methods of submission for the disk/CD-Rom or replacement disk/CD-Rom. 
Any reply including a sequence listing in electronic form should NOT be sent to the 2023 1 zip code address for the 
United States Patent and Trademark Office, and instead should be sent via the following to the indicated addresses: 

1. EFS-Bio (<http://www.uspto.gov/ebc/efs/downloads/documents.htrn> , EFS Submission 
User Manual - ePAVE) 

2. U,S. Postal Service: U.S. Patent and Trademark Office, Box Sequence, P.O. Box 2327, Arlington, VA 22202 

3. Hand Carry directly to: 

U.S. Patent and Trademark Office, Technology Center 1600, Reception Area, 7 th Floor, Examiner Name, 
Sequence Information, Crystal Mall One, 1911 South Clark Street, Arlington, VA 22202 



U.S. Patent and Trademark Office, Box Sequence, Customer Window, Lobby, Room 1B03, Crystal Plaza Two, 
201 1 South Clark Place, Arlington, VA 22202 
4. Federal Express, United Parcer Service, or other delivery service to: U.S. Patent and Trademark Office, 
Box Sequence, Room 1B03-Mailroom, Crystal Plaza Two, 201 1 South Clark Place, Arlington, VA 22202 



Application Serial Number: 
Source: 

Date Processed by STIC: 




Or 



Revised 01/29/2002 



R*w'Seqoencc Listing Error Summary 

EMWETtcTHi. syg^nrcrorormaj mi. mum, .0<l/49 3./W/f 



1 Wrapped Nucleic* 

Wrapped Aminos 



2 Invalid Line Length 

►_J*/ Misaligned Amino 
Numbering 



4 Non- ASCII 

5 Variable Length 



_PatentIn 2.0 
"bug" 



7 Skipped Sequences 

(OLD RULES) 



*M « 7 2002 



_Skipped Sequences 
(NEW RIJLES) 



9 Use of n*a or XaVs 

(NEW RULES) 



10 



_lnvalid<213> 
Response 



II _Uscof<220> 



12 



13 



_PatentIn 2.0 
"bug- 



^Misuse of n 



Thcn 1 le, requi r e tl»,.K Mllrt , M€t «, 72ehar , c , e „ inlengUl This includes while spaces. 

^.T^Ss^^** icid " mi - ]iEn ' 4 00 n °' - numben; 

The submitted file w» not saved in ASOT(DOS) .ext. acquired by the Sequence Rules. PlZlVM , . 
ensure your subsequent submission Is saved In ASCII text * ^ ' ^ "ftCH CENTER 1600/2900 

«lh C0 "^ n "'' * *"'* "P rese » ,in 8 •»<« «h« one residue. Per Sequence Rule. 

This sequence is intentionally skipped Ajssnown; 
Pk ""fc<>4"* m e»(H)NUMBER0FS^^ 

S^en^ 
<400> sequence id number 
000 

S> to Imr.?.,- ^, U ' e ° f < ? 2 . 0: ;- <22 . 3> fa MANDATORY if „'. or W, are present 

to <220> to <223> section, please explam legation of „ or X M ; and which residue „ or xL represent 

(See Fede ra IReg. S ,er.-0 6 A)l/I99 8 .VoL63.N<,lHpp.2963,-32) (SeTi^ of Sequence Rule.) 

"Xt^r - " C ° Py i° DWC " fimC,i00 0f TtUMa vt ™'<» 2 -0. This cause, a corrupted file. 
^."T""f m^ory numeric identifier. «nd responds (a, indicated on n^icmcnce 
hsuna. ^ple^use-n.. Manage 0,^0^^^^.^^ 

AMOMH - Biotechnology Systems Branch - 08/21/2001 
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1636 



Input Set : A:\PTO.VSK.txt 

Output Set: N:\CRF3\02212002\I993179A.raw 



RAW SEQUENCE LISTING DATE: 02/21/2002 

PATENT APPLICATION: US/09/993 , 179A TIME: 11:37:48 

Does Not Comply 

3 <no> applicant: McCarthy, Sean a. Gorocted Diskette Needed 

4 Kuranda, Michael Joseph 

5 Bulawa, Christine Ellen 

6 Bossone, Steven 

8 <120> TITLE OF INVENTION: METHOD FOR IDENTIFYING GENES ENCODING SIGNAL SEQUENCES 

10 <130> FILE REFERENCE: 09404/032001 

12 <140> CURRENT APPLICATION NUMBER: US/09/993, 179A 

13 <141> CURRENT FILING DATE: 2001-11-06 
15 <160> NUMBER OF SEQ ID NOS : 15 

17 <170> SOFTWARE: FastSEQ for Windows Version 3.0 



ERRORED SEQUENCES 

52 <210> SEQ ID NO: 2 

53 <211> LENGTH: 50 

54 <212> TYPE: PRT 



55 <213> ORGANISM: Homo sapiens l^/^c- 
57 <400> SEQUENCE: 2 ^ Jyyy ^ < ^ 



t Lys Gly Ehr Cyo- Val -JLle Ala Trp Leu Phe Ser Ser Leu Gly Leu . M „ i ? _ 

p ArgTeU Ala Hit*- J?x^r^fu Ala Gin Gly Thr Thr Gin Cys Gin Arg ^ S 

^20 25 3J)) LAA4\ A^^^^i 

Leu Glu Val Asn lie J^l Ser Pro Ser Ser Lys Ala Thr Phe Ser lit 

^ 12 — *J^> 



58 Met Lys Gly ghr Cyo-Val - 
E--> 59 

60 Trp ArfTed Ala Hit*- 
E--> 61 

62 Thr Leu Glu Val Asn lie Jgl Ser Pro Ser Ser Lys Ala Thr Phe Ser 
E--> 63 

64 Pro Ser 

65 50 

112 <210> SEQ ID NO: 4 

113 <211> LENGTH; 125 

114 <212> TYPE: PRT 

115 <213> ORGANISM: Homo sapiens 

117 <400> SEQUENCE: 4 

118 Met Arg Ser Leu Leu Arg Thr Pro Phe Leu Cys Gly Leu Leu Trp Ala 
E--> 119 1 Ck 10 if) - 

120 Phe Cya^Ala Pro Gly Ala Arg Ala Glu Glu Pro Ala Ala Ser Phe Ser * ^ q 
E--> 121 V2 0 25 _ 30 J yj^yyh^ 

122 Gin Pro^JSiy ser Met Gly Leu Asp Lys Asn Thr Val His Asp Gin Glu /)AaJ*U 
E--> 123 V35 40 45 ^ ) JZ/yw 

124 His lie Met Glu His Leu Glu Gly Val lie Asn Lys Pro Glu Ala Glu 
E--> 125 CjQ _ 55 60 J 

12 6 Met Ser Pro Gin Glu Leu Gin Leu-vHis Tyr Phe Lys Met His Asp Tyr 
E--> 127 (65 70 75 80^ 

128 Asp Gly Asn Asn Leu Leu Asp" Gly Leu Glu Leu Ser Thr Ala lie Thr 



file://C:\Crf3\Outhold\VsrI993179A.htm 
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E- 


-> 


129 






130 


E- 


-> 


131 






132 


E- 


-> 


133 






135 






136 






137 






138 






140 






141 


E- 


-> 


142 






143 


E- 


-> 


144 






146 






147 






148 






149 






151 






152 


E- 


-> 


153 






154 


E- 


-> 


155 






156 


E- 


-> 


157 






238 






239 






240 






241 






243 






244 


E- 


-> 


245 






246 


E- 


-> 


247 






249 






250 






2 51 






252 






254 






255 


E- 


-> 


256 






257 


E- 


-> 


258 






259 


E- 


-> 


260 






261 


E- 


-> 


262 






263 


E- 


-> 


264 



RAW SEQUENCE LISTING • DATE: 02/21/2002 

PATENT APPLICATION: US/09/993 , 179A TIME: 11:37:48 

Input Set : A:\PTO-VSK.txt 
Output Set: N:\CRF3\02212002\I993179A.raw 

85 90 95 

His Val His Lys Glu Glu Gly Ser Glu Gin Ala Pro Leu Glu Val Asn yfajLs??**Ls 

100 105 110 
lie Val Ser Pro Ser Ser Lys Ala Thr Phe Ser Pro Ser 

115 120 125 

<210> SEQ ID NO: 5 
<211> LENGTH: 32 
<212> TYPE: PRT 
<213> ORGANISM: Mus musculus 
<4 00> SEQUENCE: 5 

Met Lys Gly Ala Cys lie Leu Ala Trp Leu Phe Ser Ser Leu Gly Val 

is io is yU^r*-t> 

Trp Arg Leu Ala Arg Pro Glu Thr Gin Asp Pro Ala Lys Cys Gin Arg 

20 25 30 
<210> SEQ ID NO: 6 
<211> LENGTH: 45 
<212> TYPE: PRT 
<213> ORGANISM: Homo sapiens 
<400> SEQUENCE: 6 

Met Ser Pro Gin Glu Leu Gin Leu His Tyr Phe Lys Met His Asp Tyr 

15 10 15 

Asp Gly Asn Asn Leu Leu Asp Gly Leu Glu Leu Ser Thr Ala lie Thr SXjjLsrv^ 

20 25 30 
His Val His Lys Glu Glu Gly Ser Glu Gin Ala Pro Leu 

35 40 45 

<210> SEQ ID NO: 14 
<211> LENGTH: 32 
<212> TYPE : PRT 
<213> ORGANISM: Homo sapiens 
<400> SEQUENCE: 14 

244 Met Lys Gly Thr Cys Val He Ala Trp Leu Phe Ser Ser Leu Gly Leu A ^ 

l 5 io is yj^^ 

246 Trp Arg Leu Ala His Pro Glu Ala Gin Gly Thr Thr Gin Cys Gin Arg 
20 25 30 
<210> SEQ ID NO: 15 
<211> LENGTH: 108 
<212> TYPE: PRT 
<213> ORGANISM: Homo sapiens 
<4 00> SEQUENCE: 15 
255 Met Arg Ser Leu Leu Arg Thr Pro Phe Leu Cys Gly Leu Leu Trp Ala 
1 5 10 15 

Phe Cys Ala Pro Gly Ala Arg Ala Glu Glu Pro Ala Ala Ser Phe Ser y $&*^ L ' 

20 25 30 
Gin Pro Gly Ser Met Gly Leu Asp Lys Asn Thr Val His Asp Gin Glu 

35 40 45 

His He Met Glu His Leu Glu Gly Val He Asn Lys Glu Ala Glu Met 

50 55 60 
Ser Pro Gin Glu Leu Gin Leu His Tyr Phe Lys Met His Asp Tyr Asp 
65 70 75 80 



file://C:\Crf3\Outhold\VsrI993179A.htm 2/21/02 
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RAW SEQUENCE LISTING . 

PATENT APPLICATION : US/09/993, 179A ^TIME: ^11^37^48^ 

Input Set A:\PTO.VSK.txt 
Output Set: N:\CRF3\02212002U993179A.raw 



"ii 

Ill *" T ^ <f - - - ~ * Ala „« Thl Hla ^ ^ 

...» in Vil H i„r s 5 u ci » - - - - p~ u. 



/i^ ^y/ /^f / 



1 />T-h<U- JL^Aj/Cu 
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<400> 1 



ggggaccgtg tttgtggccc ccaagccggt gccccccatt ttggaactca gcgagtaggg 
ggcggctctg gggaagtggc agggggcgca gcagctgctg cctccacttc cctagccagg 
tgctgaagag gatcttcgga gccgctctgg cccccaggcg ctggatgact ggcaccagcg 
ctcctcgcac ctgtgttggt gtgtgagact tgggctggag tgcccacgtg gctgtggagt 
cagtgtgatt catgattgag gaaacgcgtc ctccatcctc tctctccttg gcactttcca 
cacatgagga gaagaagagc ttctgtttag aagacacgtg cccagagtca gaggcccctt 
gcccacc. atg aag gga acc tgt gtt ata gca tgg ctg ttc tea age ctg 409 
Met Lys Gly Thr Cys Val lie Ala Trp Leu Phe Ser Ser Leu 



60 
120 
180 
240 
300 
360 





VERIFICATION SUMMARY * 

PATENT APPLICATION: US/09/993 f 179A 



DATE : 02/21/2002 
TIME: 11:37:49 



Input Set : A:\PTO.VSK.txt 

Output Set: N:\CRF3\02212002\I993179A.raw 



L:12 M:270 C: Current Application Number differs, Replaced Current Application Number 

L:13 M:271 C: Current Filing Date differs. Replaced Current Filing Date 

L:41 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 

L:45 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 

L:49 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 

L:59 M-.332 E: (32) Invalid/Missing Amino Acid Numbering, SEQ ID: 2 ' 

M:332 Repeated in SeqNo=2 

L:81 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID: 3 
L:85 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID: 3 
L:89 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID: 3 
L:93 M-.336 W: Invalid Amino Acid Number in Coding Region, SEQ ID: 3 
L:97 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID: 3 
L:101 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:3 
L:105 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:3 
L:109 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID; 3 
L:119 M:332 E: (32) Invalid/Missing Amino Acid Numbering, SEQ ID: 4 ' 
M:332 Repeated in SeqNo=4 

L:142 M:332 E: (32) Invalid/Missing Amino Acid Numbering, SEQ ID:5 ' 
M.-332 Repeated in SeqNo=5 

L:153 M:332 E: (32) Invalid/Missing Amino Acid Numbering, SEQ ID:6 ' 
M:332 Repeated in SeqNo=6 

L:209 M:257 W: Feature value mis-spelled or invalid, <221> Name/Key for SEQ ID#:11 

L:214 M:341 W: (46) n n n or "Xaa" used, for SEQ ID#:11 

L:245 M:332 E: (32) Invalid/Missing Amino Acid Numbering, SEQ ID:14 1 

M:332 Repeated in SeqNo=14 

L:256 M:332 E: (32) Invalid/Missing Amino Acid Numbering, SEQ ID:15 7 
M:332 Repeated in SeqNo=15 



file://C:\Crf3\Outhold\VsrI993179A.htm 



2/21/02 



